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Abstract 


Several  problems  of  applied  probability  lead  to  the 
investigation  of  extremes  of  a sequence  of  random  variables 
(floods,  strength  of  materials,  failure  models,  maximal  oueuo 
length,  air  pollution  data,  etc.).  It  is  unrealistic  in  most 
cases  to  assume  that  the  random  variables  in  question  are 
independent  and  identically  distributed.  However,  there  are 
several  dependent  models  when  the  distribution  of  the  k-th 
largest  is  well  approximated  by  the  distribution  of  the  k-th 
largest  of  independent,  and  some  times  independent  and  iden- 
tically distributed  random  variables  (possibly  changing  the 
fixed  size  of  the  sequence  in  question  to  a random  one).  See 
the  book  below,  particularly  Sections  3.1,  3*12,  6.1  and  6.2, 
in  which  both  the  possibility  of  such  approximations  and  the 
limit  lows  when  such  approximation  foils  are  discussed. 

We  shall  describe  several  models  with  emphasis  on  the 
possibility  of  approximations  by  independent  random  variables. 

In  addition,  we  shall  analyze  the  following  embarrassing  ques- 
tion of  an  applied  scientist:  in  a sample  of  size  n = 1,000  , 
say,  is  the  10-th  largest  an  extreme  in  the  sense  that  k=10 
is  fixed  as  n increases  (if  n = 1,000  is  not  large  enough, 
take  n = 10^),  or  should  one  consider  the  10-th  largest  as 
the  lyfn  - th  largest  ? Since  the  limiting  distributions  are 
different  in  the  two  cases,  the  decision  on  the  actual  case  to 
be  applied  is  significant. 

Through  this  analysis,  we  shall  point  otat  several  important 
'•rules"  for  applying  asymptotic  extreme  value  theory. 

Reference 

Galambos,  Janos  (1978).  The  Asymptotic  Theory  of  Extreme  Order 
Statistics.  John  Wiley  & Sons,  Inc.,  Hew  York. 


1.  Stochastic  Model  Building 


1.1.  General  comments.  One  of  the  most  crucial  part  of  an 
applied  scientist's  decision  is  the  adoptation  of  a stochastic 
model  for  the  random  quantities  X, ,X0....,  X he  is  to  deal 
with.  Whether  t.hese  random  quantities  are  produced  by  nature 
(floods,  winds,  etc.),  or  by  a monufactur ing  procedure  (life 
lengths  of  comnonents  of  an  equipment),  or  the  statistician 
collects  them  by  random  sampling,  it  is  very  rare  that  a sci- 
entific reasoning  would  lead  to  a unique  dependence  structure 
and  a well  defined  family  of  distributions  for  the  X's.  Con- 
sequently, the  basic  underlying  dependence  and  distributional 
assumptions  are  subjective  to  a large  extent.  The  question 
thus  aris.es  whether  substantially  different  conclusions  can 
be  reached  by  two  scientists,  assuming  that  both  of  them  work 
under  "reasonable  assumptions"?  Here  "reasonable"  means  that 
the  assumptions  are  compatible  with  general  practice  in  the 
scientific  literature. 

For  an  outsider,  the  fact  alone  that  thin  question  arises 
should  be  shocking  who  trusted  us  that  scientific  decisions 
ought  to  be  unquestionable.  Unfortunately,  not  just  the  ques- 
tion aris.es  but  the  answer  to  it  is  the  real  disappointment: 
the  subjectivity  of  model  building  does  influence  the  other- 
wise uniouely  determined  scientific  procedure  of  decision 
making.  Therefore,  the  choice  of  the  model  has  to  be  based  on 
more  careful  studies  than  a routine  acceptance  of  independence 
and  a family  of  distributions  (normality  or  other  popular  ones). 
One  cannot  expect  to  build  a theory  of  model  building  with  a 
general  appeal,  since  the  practical  problem  to  be  settled  should 
be  a major  consideration  in  adopting  n model.  Evidently,  one 
has  to  be  more  accurate  when  human  life  is  involved  such  as 
effects  of  food  additives,  drugs  and  medical  treatment  or 
safety  of  equipments.  Financial  and  legal  considr rations  also 
increase  the  demand  for  more  accurate  models.  But  whether  newer 
approaches  ore  morn  accurate  is  very  doubtful.  The  almost  daily 
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rejection  of  food  additives  on  the  bane  of  moving  aw ay  from 
the  classical  assumption  of  normality  to  n new  underlying 
distribution  does  not  correct  the  error  of  the  model  by  which 
these  seme  additives  were  accepted  earlier  as  safe.  Neither 
did  the  model  used  for  the  evaluation  of  air  pollution  data 
become  more  accurate  by  changing  the  assumption  concerning 
the  distribution  of  pollutant  concentration  first  from  normal 
to  lognormal  and  more  recently  to  other  families,  mainly  mixtures. 
One  general  rule  can,  however,  be  set  up:  decisions  of  major 
importance  should  be  reached  after  the  data  has  been  evaluated 
in  several  alternate  models. 

Examples  in  subscouent  sections  will  support  all  claims 
of  this  introduction. 

1.2.  Stochastic  models  with  extremes.  In  the  book,  Galambos  (1978), 
the  present  author  described  and  developed  several  dependent 
models,  when  the  extremes  govern  the  laws  of  interest.  The  aim 
of  the  present  paper  is  to  summarize  some  of  these  available 
models  an,d  to  point  to  an  important  direction  of  future  de- 
velopment. 

Let  us  start  with  a list  of  applied  fields  where  the  solu- 
tion is  in  terms  of  the  maximum  Z^  or  minimum  V/^  of  the 
basic  random  quantities  X^,  l$J<n,  associated  with  the  actual 
practical  problem. 

The  annual  flood  on  a river  is  measured  as  the  maximum 

Z of  the  daily  highest  water  levels  X.,  j^l.  The  X's  here 
n j 

are  dependent  and  closer  the  observations  are  in  time,  the  stronger 
the  dependence  is.  Within  a season,  and  at  some  locations  through- 
out the  year,  the  X's  can  be  assumed  to  be  identically  distributed. 
In  order  to  plan  for  building  a dam  on  a river  at  a given  location, 
an  accurate  description  of  Z^  is  reouired.  If  two,  or  if  several 
dams  are  to  be  built  on  the  same  river,  then  the  X's  have  to  be 
measured  at  those  selected  locations  and  our  interest  is  the 


mu 


Itivariate  distribution  of  the  correspond ing  Z's. 


Notice  that  a choice  of  a mathematical  model  for  the  de- 
scription of  the  annual  flood  is  enuivalont  to  accepting  a concept 
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for  a weakening  dependence  ns  time  passes  and  a choice  for  the 
common  distribution  of  the  X's  in  a reason  or  for  the  year  nr 
the  care  may  be.  Because  other  applied  fields  lead  to  this  same 
mathematical  problem,  let  us  go  ahead  with  the  list  mentioned 
earlier. 


The  random  strength  of  a sheet  of  certain  material  under 

stress  can  be  expressed  as  a minimum  os  follows.  Let  us 

hypothetically  subdivide  the  sheet  into  n parts.  If  X.  is 

J 

the  strength  of  the  j-th  part  in  thir  subdivision,  then  the 
so  called  weakest  link  principle  tells  us  that  the  strength  of 
the  original  sheet  is  indeed  W of  these  X..  Again,  those 

* * J 

X^  which  represent  the  strength  of  distant  parts  hove  weaker 
dependence  than  neighbouring  ones  and  in  fact  some  kind  of  almost 
independence  is  valid  as  distance  increases  indefinitely.  Here, 
unit  is  proportional  to  the  reciprocal  of  the  souere  root  of  n 
(make  the  hypothetical  subdivision  of  a rectangle  in  such  a way  th* t 

p 

both  edges  ore  divided  into  m equal  parts  and  set  n = m' ) . 

If  the  division  is  done  into  equal,  parts,  then  the  X's  can 
again  be  assumed  to  be  identically  distributed. 


The  time  to  the  first  failure  of  a complicated  equipment  is 
best  approached  by  first  grouping  components  into  so  called  cuts 
or  pathee.  By  such  an  approach,  an  arbitrary  equipment  becomes 
equivalent  to  a parallel  or  series  system.  However,  while  parallel 
systems  are  installed  for  safety  considerations  and  thus  they  are 
expected  to  function  independently,  the  above  reduction  procedure 
can  lead  to  very  complicated  dependence  structures.  The  actual 
grouping  procedure  is  as  follows.  A path  is  a set  of  components 
whose  functioning  insures  the  functioning  of  the  equipment.  A 
minimal  path  is  a path  whose  number  of  elements  cannot  be  redticed 
without  violating  the  defining  property  of  a path.  On  the  other 
hand,  a cut  is  a set  of  components  whose  failure  causes  the  equip- 
ment to  fail. Pina] ]y,  a minimal  cut  is  a cut  with  n minimal  number 
of  elements.  Now  if  Xj  represents  the  random  time  to  the  first 
failure  of  the  j-th  minimal  path  of  the  equipment,  then  Z^  is 


the  time  to  the  first  failure  of  the  equipment.  This  seme 
random  waiting  time  until  the  equipment  fails  can  also  be 
expressed  as  the  minimum  V/0  of  the  random  times  unto  the 
failures  of  minimal  cuts.  Since  either  n or  s is  large 
for  a complicated  equipment,  its  failure  is  well  described 
by  an  asymptotic  model.  However,  since  different  path  sets 
may  contain  several  common  components,  the  assumption  of 
independence  of  the  is  unjustified  even  in  an  approxi- 

mate sense.  Evidently,  a similar  remark  applies  to  cuts  as 
well. 

Here  we  cannot  speak  of  close  and  distant  neighbours 
of  path  nets  and  yet  the  dependence  structures  of  the  three 
problems  listed  can  be  described  by  a unified  model.  What 
will  be  essentially  different  in  the  failure  model  when 
compared  with  the  models  for  strength  or  flood  that  a single 
distribution  cannot  be  used  for  all  path  sets.  We  should 
rather  seek  a mathematical  solution  when  each  path  net  is 
permitted  to  have  different  failure  distribution.  Strangely 
indeed,  this  generality  leads  to  the  mathematical  conclusion 
that  engineers  applied  all  along:  any  distribution  with  mo 
notonic  hazard  rate  is  an  asymptotic  failure  distribution. 

The  unified  dependence  model  for  the  three  listed  problems 

is  given  by  a set  of  random  variables  X^,  Xg,...,  X^  which 

form  a so  called  E*-senuence.  Since  it  is  accurately  defined 

on  pp. 176-177  of  the  author's  book  referred  to  earlier,  we 

describe  it  here  in  vague  terms  only.  The  reauirement  is  that 

a net  E*  of  exceptional  pairs  (X.,X.)  can  be  found  v/ith 
n i j 

which  the  following  three  basic  properties  hold  for  the  original 

sequence  X.,  1 ^ j<ns 
J 

(i)  the  events  ^X..£xj  are  asymptotically  independent 

t 

as  x becomes  "large",  whenever  no  pairs  of  the 
subscripts  are  exceptional; 

(ii)  if  exactly  one  pair,  (im,it)  nay,  is  exceptional 

among  the  subscripts  of  X^  , l$s<k,  then  the 

xn 


b 


probability  of  the  intersection  of  the  events  £x.  > x? 

3 s * 

is  proportional  to  the  product  of  F(X.  > x,  X.  > x) 

^vx"  Jt  " 

and  the  univariate  tails  P(X.  > x) , s ^ m,t; 

a 

and  ^ 

(iii)  the  number  of  elements  in  the  exceptional  set  E 

2 n 

is  of  smaller  magnitude  than  n , which  is  the  number 
of  all  pairs  of  the  subscripts  of  the  X^  . 

Under  those  conditions,  the  univariate  and  bivariate  marginals 

of  the  X ^ determine  whether  the  maxima,  when  normalized,  have 

a limiting  distribution  and  when  their  do,  what  is  the  actual 

form  of  this  limiting  distribution.  There  is  a simple  criterion 

available,  and  which  is  of  wide  applicability  to  applied  questions, 

which  guarantees  that  the  limiting  distribution  of  the  normalized 

maximum  for  an  E -sequence  exists  and  this  limiting  distribution 

n 

is  the  same  as  if  the  X.  were  completely  independent.  However, 

J 

when  this  condition  fails,  the  limiting  distribution  may  still 
exist,  but  the  corresponding  theory  is  not  well  developed  lor 
such  case. 

This  unified  approach  through  E*-seouences  covers  several 
well  developed  dependence  conoepts  ouch  as  m-dependence  and 
variants  of  mixing.  As  was  emphasized  earlier,  the  basic  assump- 
tions of  a model  are  mainly  a matter  of  belief  when  applied  in 
a concrete  sitiiation;  the  concept  of  an  En~sequence  is  developed 
exactly  with  this  dilemma  in  mind.  While  the  special  case  of  a 
mixing  concept,  say,  requires  the  validity  of  a weakening  de- 
pendence for  certain  sequences  of  sigma  fields  as  the  major 
assumpti on,  our  only  assumption  that  cannot  be  checked  is  (ii). 
Namely,  one  can  construct  the  set  E*  in  such  a way  that  (i) 
and  (iii)  be  automatically  satisfied.  This  construction  of  E* 
for  the  three  problems  mentioned  is  self  evident.  For  the  flood 
end  strength  models,  E*  will  contain  any  pairs  which  represent 
"close  neighbours",  while,  for  the  failure  model,  (i,j)  belongs 
to  E*  if  X..  and  Xj  represent  life  length  of  such  path  seta 
which  have  common  elements. 


I 


7. 


Another  approach  to  failure  models  is  possible  through 
exchangeable  variables.  It  in  quite  a surprising  result  that 
a kind  of  averaging  is  possible  in  such  a sensitive  field  as 
extreme  value  theory  (nee  Sections  3»1  and  3*2  of  the  book 
by  Goiarabos  (1978)).  We  do  not  discuss  this  approach  here, 
but  we  wish  to  draw  attention  to  it  because  of  its  varied 
possibility  for  further  investigations. 

We  conclude  thin  section  by  mentioning  two  approaches 
to  specific  problems  of  extreme  value  theory  which  bear  in- 
teresting mathematical  facts.  One  is  the  approach  by  C.C.  Heyde 
(1971)  to  queue  length,  where  dependent  variables  arc  "transformed' 
to  independent  ones  (but  with  a random  number  of  elements).  The 
other  one  is  the  classical  approach  by  H.E.  Daniels  (1945)  to 
strength  of  bundles  of  threads,  in  which  the  opposite  is  done: 
originally  independent  variables  are  "transformed" to  dependent 
ones.  Interestingly,  these  dependent  variables  can  also  be  re- 
presented as  a function  of  indenendent  random  variables  of  random 
size,  namely,  a quantile  with  random  index.  However,  no  attempt 
has  been  made  so  far  to  develop  the  theory  of  strength  of  bundles 
on  this  latter  line. 

Further  important  applications  are  touched  upon  in  the  next 
section  where  we  discuss  the  effects  of  the  choice  of  a popula- 
tion distribution. 

2.  The  choice  of  a population  distribution 

We  speak  of  a population  distribution  when  a random  phenomenon 
can  be  described  by  a sequence  of  identically  distributed  random 
variables  7-j,  X2,...,  Xn»  This  is  the  case  of  floods,  strengths 
(both  sheets  and  bundles),  certain  failure  models  (e.g.  parallel 
and  series  systems),  permitted  level  of  air  pollution  concentration 
(the  US  Standard  is  set  by  the  second  largest  observation)  and 
effects  of  food  additives  or  drugs  on  persons  belonging  to  a group 
of  identical  physical  conditions.  Notice  that  independence  is 
not  emphasized.  However,  in  the  analysis  that  follows  it  will  be 
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on  occasion  easier  to  stress  a point  by  assuming  that  an 
approximation  by  independent  variables  in  possible. 

Let  up  put  down  come  notations.  The  basic  random  variables 
are  denoted  by  X^,  X?,...,  Xn  . They  are  assumed  to  be  iden- 
tically distributed  and  we  use  F(x)  for  their  common  distribu- 
tion function.  We  put  Z and  W for  the  maximum  and  the 

n n 

minimum,  respectively,  of  the  X's,  whose  distribution  functions 
are  Hn(x)  and  Ln(x),  respectively.  Finally,  the  empirical 
distribution  function  of  the  X*c  is  denoted  by  Fn(x).  In  the 
frequency  approach  to  probability,  Fn(x)  has  to  be  a pood  approxi- 
mation to  F(x).  But  can  we  rely  on  any  good  approximation  to 
F(x)  if  our  interest  is  ( x ) ? As  we  shall  see, no  classical 
approximation  is  acceptable  and  thus  specific  techniques  are 
needed  in  extreme  value  theory.  Of  course,  if  we  know  F(x) 
accurately  through  a so  called  characterization  theorem  (for 
questions  of  normality,  see  the  book  by  Kogan,  Linnik  and  Rao 
(1973)  and  for  other  distributions,  the  one  by  Galcmbos  and 
Kotz  (1978)),  then  no  statistical  method  is  needed. 

First  let  us  look  at  the  meaning  of  the  empirical  distribu- 
tion function.  By  definition,  Fn(x)  = 1 for  all  x>Z^  • The 
meaning  of  this  fact  is  that  the  jump  of  Fn(x)  at  2 represents 
the  increament  1 - F(x)  of  the  population  distribution  for  an  x 

close  to  Z but  the  actual  variation  of  the  tail  1 - F(x)  is 
n 

not  approximated  by  Fn(x)  for  x*s  exceeding  Z But  since 

the  tail  1 - F(x)  alone  determines  Hn(x)  for  large  values  of 

n,  whenever  the  basic  random  variables  are  almost  independent, 

we  simply  cannot  expect  Fn(x)  to  n puide  in  approximating 

H (x).  Now  wo  just  have  to  observe  that,  both  the  chi-squared  test 
n 

for  goodness  of  fit  and  the  more  frequently  used  fitting  by 
a specific  probability  paper  are  based  on  F (x)  and  w©  can  thus 
conclude  that  classical  methods  are  inapplicable  to  approximate 
F( x)  in  extreme  value  theory.  In  order  to  be  more  specific,  let 
us  look  at  a numerical  example. 

Assume  that  a random  quantity  X is  associated  with  the 
operation  of  an  equipment  (or  the  human  body).  The  equipment  fails 
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if  X exceeds  3300  and  its  operation  in  safe  in  the  range 
2000  to  3?00.  We  made  n = 500  observations  on  X and  all  were 
in  the  safety  range.  In  fact,  = 2342  and  = 3071. 

Furthermore,  the  X’o  were  varying  around  the  mean  2698  with 
standard  deviation  104,  and  the  reduced  values  (X-2698)/l04 
gave  a very  good  fit  on  0 standard  normal  probability  paper. 

Is  it  now  justified  to  use  a normal  table  to  comnute  the  safety 
of  100,000  eouipments,  say  ? If  a failure  means  only  a financial 
loss,  then  not  much  problem  in  involved:  one  computes  the  expected 
number  of  losses,  includes  them  in  the  price  and  gives  a warranty 
against  loss.  However,  if  a single  failure  means  disaster  (loss 
of  life  or  permanent  damage  to  humans),  then  we  have  to  determine 
the  population  distribution  F accurately  enough  that  the  value 
100,000(1  - F( y ) ) with  y =( 3300-2698)/l04=5 . 788  be  negligible. 

Since  our  observations  do  not  give  any  information  on  F(y)  beyond 
y = ( 3071-?698)/l04  = 3*587,  a fitting  based  on  these  observations 
is  insufficient  for  our  purposes.  What  we  actually  want  is  a 
good  approximation  to  H (x)  with  s = 100,000.  A separate  paper 

f) 

is  devoted  to  this  problem;  see  Galambos  (1978a). 

3.  The  accuracy  of  an  approximate  model 

The  discussion  in  the  preceding  section  already  shows  that 
accuracy  is  more  significant  in  extreme  value  theory  than  in 
most  classical  statistical  problems.  Unfortunately,  very  few 
speed  of  convergence  estimates  are  available.  There  is  actually 
only  one  result  on  this  line  which  first  appeared  in  the  author's 
book  (Section  2.10).  That  result  can  also  be  extended  to  En-senuonce: 
without  nruch  effort.  We  do  not  reproduce  those  results  here,  knther 
we  would  describe  another  problem  that  also  leads  to  a problem  of 
accuracy. 

In  asymptotic  theory  one  distinguishes  between  upper  extremes 
and  large  order  statistics  depending  whether  the  rank  of  an  order 
statistic  is  fixed  or  varies  with  the  sample  size.  More  precisely, 
let  X,  . be  the  k-th  largest  Rrnong  n observations.  If  k is 
fixed  as  n increases  indefinitely,  then  is  called  the 
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k-th  upper  extreme.  On  the  other  hand,  if  k varies  with  n 
but  k/n  converges  to  one  as  n goes  to  infinity,  then  we 
refer  to  X,  . as  a large  order  statistic.  Now,  in  several 
applied  fields  one  faces  the  problem  of  evaluating  the  behaviour 
of  X.  where  k is  small  and  n is  large.  Which  case  applies 
if  nsymntotic  theory  is  to  be  the  basic  tool  ? For  example,  if 
k = 10  and  n = 1,000,  is  X,  an  uprer  extreme  or  a large 
order  statistic  in  the  above  sense  ? The  distinction  is  significant 
because  in  the  case  of  extreme,  a gamma  type  distribution  is 
obtained  for  the  limiting  case  while  with  varying  k (in  our  case 
^"rT  ) , the  limiting  distribution  is  related  to  the  normal  distribu- 
tion. Since  the  parameters  are  small  here,  a normal  approximation 
is  not  justified  for  the  gamma.  The  answer  is  not  an  easy  one  and 
a general  answer  cannot  be  given.  When  one  decides  to  use  one  case 
or  the  other,  the  accuracy  of  the  approximation  has  to  be  estimated 
This  can  only  be  done  if  the  population  distribution  is  known. 

If  not,  then  a method  similar  to  the  one  described  for  the  maximum 
in  Galambos  (1978a)  should  be  developed.  No  result  seems  to  be 
available  on  this  line. 
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